GOF OF IRT MODELS 1 Goodness of fit assessment of item response theory models
نویسنده
چکیده
The article provides an overview of goodness of fit assessment methods for item response theory (IRT) models. It is now possible to obtain accurate p-values of the overall fit of the model if bivariate information statistics are used. Several alternative approaches are described. As the validity of inferences drawn on the fitted model depends on the magnitude of the misfit, if the model is rejected it is necessary to assess the goodness of approximation. With this aim in mind, a class of Root Mean Squared Error of Approximation (RMSEA) is described, which makes it possible to test whether the model misfit is below a specific cut-off value. Also, regardless of the outcome of the overall goodness of fit assessment, a piecewise assessment of fit should be performed to detect parts of the model whose fit can be improved. A number of bivariate statistics for this purpose are described, including a mean and variance correction to Pearson’s X statistic applied to each bivariate subtable separately, and the use of z-statistics for residual crossproducts.
منابع مشابه
Goodness-of-Fit Assessment of Item Response Theory Models
The article provides an overview of goodness-of-fit assessment methods for item response theory (IRT) models. It is now possible to obtain accurate p-values of the overall fit of the model if bivariate information statistics are used. Several alternative approaches are described. As the validity of inferences drawn on the fitted model depends on the magnitude of the misfit, if the model is reje...
متن کاملAssessing IRT Model-Data Fit for Mixed Format Tests
This study examined various model combinations and calibration procedures for mixed format tests under different item response theory (IRT) models and calibration methods. Using real data sets that consist of both dichotomous and polytomous items, nine possibly applicable IRT model mixtures and two calibration procedures were compared based on traditional and alternative goodnessof-fit statisti...
متن کاملAn Updated Review of Goodness of Fit Tests Based on Entropy
Different approaches to goodness of fit (GOF) testing are proposed. This survey intends to present the developments on Goodness of Fit based on entropy during the last 50 years, from the very first origins until the most recent advances for different data and models. Goodness of fit tests based on Shannon entropy was started by Vasicek in 1976 and were continued by many authors. In this paper, ...
متن کاملCLASSICAL TEST THEORY vs. ITEM RESPONSE THEORY An evaluation of the theory test in the Swedish driving-license test
The Swedish driving-license test consists of a theory test and a practical road test. The aim of this paper is to evaluate which Item Response Theory (IRT) model among the one (1PL), two (2PL) and three (3PL) parameter logistic IRT models that is the most suitable to use when evaluating the theory test in the Swedish driving-license test. Further, to compare the chosen IRT model with the indice...
متن کاملIRT-FIT: SAS® Macros for Fitting Item Response Theory (IRT) Models
Psychometrics has recently seen the development of complex measurement models to better represent test and item data. Item Response Theory (IRT), in particular, comprises a set of non-linear latent variable models that appear to have several conceptual and empirical properties that make them more valuable in practice than classical test theory methods. However, IRT-based models typically requir...
متن کامل